Blar i NTNU Open på forfatter "Feiring, Patrick"
-
Deep Reinforcement Learning for Model-Free Continuous Control with an Emphasis on Trust Region Policy Optimization
Feiring, Patrick (Master thesis, 2017)Reinforcement learning is a general framework for optimizing the behavioural policy of an agent in an environment that issues a scalar reward indicating how well the agent is performing. Reinforcement learning algorithms ...